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Abstract 

We present a computational method for measuring financial risk by estimat- 
ing the Value at Risk and Expected Shortfall from financial series. We have 
made two assumptions: First, that the predictive distributions of the values 
of an asset are conditioned by information on the way in which the variable 
evolves from similar conditions, and secondly, that the underlying random 
processes can be described using piecewise Gaussian processes. The perfor- 
mance of the method was evaluated by using it to estimate VaR and ES for 
a daily data series taken from the S&P500 index and applying a backtesting 
procedure recommended by the Basel Committee on Banking Supervision. 
The results indicated a satisfactory performance. 
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1. Introduction 



Since the classic study by Haavelmo ( 1944 ) on the probability approach in 



econometrics, there has been a tendency to assume that a series of values of 
an asset, {V T \ r — 1, . . . , t}, represents a random process with density p T (V). 
In fact, from the point of view of financial practice, what is interesting is to be 
able to make predictions conditioned by a particular information set. Thus, 
what we really want to model is the density pt+i(V\It), conditional on the 
information I t known at t, from which the expected price of the asset: 



V t+ i= / V Pt+1 (V\I t )dV 
Jo 

and the deviations in these prices 



a t+1 = J ^ (V- V t+1 y Pt+1 {V\I t )dV 

can be calculated. 

The expected return, Rt+i = (Vt+i—Vt) /V t , can thus be obtained from the 
first integral, whilst the standard deviation is a measure of the uncertainty 
that is directly related to indexes commonly used in risk management, such 
as the Value at Risk (VaR) and Expected Shortfall (ES). The problem, of 
course, is that there is only one realization for each random process, thus 
the conditional distribution is an unobservable quantity that can only be 
inferred. 

In this context, this study has a double aim: firstly, to use (in a way that 
we will describe in detail later) information about the evolution of prices from 
similar initial conditions, as the information I t that conditions the distribu- 
tion we are looking for, and secondly, to design an inference model based on 



the assumption that the underlying random processes are piecewise Gaussian 
processes. 

Due to the simplicity and versatility of Gaussian Processes (GP) for the 
modeling of arbitrary functions, they have become the basis for several tech- 
niques that have been developed to analyze several types of spatial and tem- 



poral problems (see e.g. Brahim-Belhouari and A. Bermak (2004), Bukkap- 



atnam and Cheng (2010), Deisenroth et al. (2009), di Sciascio and Amicarelli 



(2008), Gosoniua et al. (2009), Kim et al. (2005), Ko and Fox (2009) and 



Wang et al. (2008)). In addition, modeling with GP has an advantage that 



makes it particularly attractive from an econometric point of view, which 
is that it does not only permit predictions of the values of a series to be 
made, but also generates predictive distributions. This means that both the 
volatilities (given by the standard deviations of the predictive distributions) 
and indexes of risk such as VaR can be estimated directly. Nevertheless, to 
the best of our knowledge, this approach has not been previously used for 
the estimation of financial risk indexes. 

Furthermore, our GP based models are local in the sense that the involved 
parameters depend on the information I t , that in turn changes with time. 
As a consequence, the conditional predictive variances are time dependent 
and the models are thus heteroscedastic. In contrast to the ARCH/GARCH 
type models, however, there is no equation that controls the evolution of 



the volatility (see Engle et al. (2008)), and this change is instead driven by 
changes in I t . 

In the following section we introduce the specific type of piecewise models 
that we apply to estimate the risk in a financial time series. In section III we 
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apply a validation procedure (backtesting) to evaluate the model according 
to recommendations given by the Basel Committee on Banking Supervision 
dBasel I| ( p95| , |Basel II| ( [1996] ) and |HuII| ( |2007| )). Concluding remarks are 



presented in Section IV. 

2. Prediction of the Value at Risk and Expected Shortfall using 
Piecewise Gaussian Processes 

In what follows, we consider the simple case where we wish to make an 
estimation, p t+ i(y\I t ), of the conditional distribution p t+ i(V\It) for a known 
series of the values of an asset V\, V 2 , . . . , V t , in order to predict the return 
one step ahead: 



oo 



R t+1 = (V t+1 /V t ) - 1; V t+1 = / Vp t+1 (V\I t )dV (1) 

Jo 

and the volatility measured by the deviation: 



*£i ^^~ = V t \jJ {V - Vt + i) 2 Pt + i(V\I t )dV (2) 

Specifying the information by which it is conditioned is therefore essen- 
tial. We thus assume, as for the majority of the models used in financial 
econometrics, that events which occurred most recently are of greater rele- 
vance when we wish to establish conditions in the near future. This is the 
same as assuming that I t is contained in the sequence of values of the recent 
past: Vt-i+i, V t -i+2, ■ ■ ■ j Vt-i, Vt, corresponding to a rolling window of size 
/. Part of the original contribution of our proposal has to do with the way 
in which I t is extracted from this data window. In this sense, our specific 
interest is focused on using the values of the asset which follows patterns of 
behavior of the data that are similar to those observed in the current window. 



In order to explain in more detail what we mean, let's start by considering 
all possible segments of historical information that can be constructed with 
I consecutive values of the prices of an asset: 

vi = (V^,...,^) 
v 2 = (V 2 ,V 3 ,...,V l+1 ) 

vt-i+i = (Vt-i+i, . . . ,V t -i,V t ) 
The behavior patterns are understood to be the standardizations: 

zv r = (zV T ,zV T +i, . . . ,zV T+ i-i); zV s = — — — ; t<s<t + 1-1 

(3) 



S=T \ S=T 



T+l-1 

-VrY 



where I < t < t — I + 1, such that even when the scales of the asset values 
may be very different between windows, the patterns are scale free profiles 
(see Figure 1). 

Lets look now at the set of the k /-dimensional behavior patterns that are 
most similar, as regards to their Euclidean distances, to the current pattern 
zv t _;+i. Thus we need to identify k vectors that correspond to the smallest 
values of: 

D t -i+i, T = ||zv t _ i+1 - zv T || 
and place them in increasing order: 

ZV ni(M ),ZV n2 ( M ), . . . ,ZV nk (t,l) 

where A-i+i,m(t,/) < A-«+i,n 2 (*,0 < • • • < Dt-i+i,n k (t,i)- 
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Figure 1: An example of what results from the process of seeking patterns similar to the 
current pattern. Standardization permits us to compare each past pattern with the current 
pattern without taking into account the scale. Thus, what we use as inputs for the local 
GP are similar profiles. 
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Now we construct, by appropriately scaling each V ni ^ + i observed one 
step ahead of the i-th pattern, the respective standardized values of the asset 
which correspond to each of the patterns that define the neighborhood of the 
current pattern: 



zv ni ( t ,i) — > zV ni (t,i)+i 



Vm(t,i)+l ~~ H»i(t,0 



1 < i < k 



(4) 



What we assume as being the information I t , is the set of ordered pairs: 



where 



x(t,z) = {xi(*,o,i/i(M); 1 < k h 



Xi(t,l) = zv n! ( y) , yi{t,l) = zV ni ^i )+ i 



(5) 



(6) 



The second step in our method is to adjust the hyperparameters of the 
GP represented by a set of random variables f t {xi(t,l)) and a noise term 
that satisfy the local condition: 



yi(t,0 = /t(xi(M))+ei| e* ~ z = l,2 



(7) 



which we then use to generate the conditional predictive distribution 

2(2,*(t,0|X(t,Z),x*(t,0) 

where x*(t,l) = zv t _/ + i. 

By following the standard procedure used in these cases and that is de- 



scribed in the relevant literature (see e.g. Brahim-Belhouari and A. Bermak 



(2004), Kim et al. (2005) and Rasmussen and Williams (2006)), we obtain 

(8) 



that: 



g(y*(t,l)\yL(t,l),x*(t,l))<xexp 



(y*(t,l)-y(t,l)y 



2a* 2 (t,l) 
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where y(t,l) can be identified as the expected value of y*(t,l) and a* 2 (t,l) 
with the variance, which are given respectively by: 



y(t, I) = c(t, l)C-\t, l)y(t, Z); a* 2 (t, I) = 7 (f , /) - c(t, l)C-\t, l)c T (t, I) (9) 



with: 



c(f , I) = (c(x*(t, Z), Xl (t, Z)), c(x*(t, /), x 2 (t, /)),..., c(x*(t, Z), x fc (t, /))), (10) 
C _1 (i,Z) is the inverse matrix of: 



/ 



c(xi(t,Z),Xi(i,Z)) c(xi(t,/),x 2 (t,Z)) ... c(xi(t,Z),x fc (t,Z)) 
c(x 2 (t,/),x 1 (t,Z)) c(x 2 (*,Z),x 2 (*,Z)) ... c(x 2 (t,/),x fc (t,/)) 



^ c(x fc (t,Z),xi(t,Z)) c(x fc (i,Z),x 2 (*,Z)) ... c(x fc (f,Z),x fc (f,Z)) y 



y 2 (M) 



(ii) 



y(M) 



; 7 (M) = c(x*(t,Z),x*(t,Z)) 



(12) 



the superindex T denotes transposition, and c(a, b) is the covariance function 
defining the CP, here assumed as: 

1 



c(a, b) = Q\ exp 
where M 4 is the diagonal matrix: 



(a-b) J M t (a-b) 



(13) 



1/8 








V i/4-/ 



In order to predict the asset value V t+ \, we take the inverse of the standard- 
ization process on the expected value y(t, I): 

v t +i = <Tv t - l+1 -y{t,i) + v^ l+1 , (14) 

while the prediction for the uncertainty a t+ i is: 

a t+ i = a Vt _ l+1 ■ a* (15) 

It is worth emphasizing the local character of the proposed scheme, in the 
sense that in order to obtain the predictive distribution at t + 1 we use a GP 
whose hyperparameters (z/ t 2 , Q\ , s u , s 2 t, • • • , %) are fitted to the information 
I t . This itself is dependent on time, as indicated by Eq. [3](6j causing the 
GP parameters to change, providing a sufficiently flexible method for the 
modeling of non stationary series. As regards to the manner in which the 
parameters that intervene in the co- variance matrix are fixed, a criteria that 
is frequently used is to maximize the log-likelihood: 

C = - l -log[det{C{t,l))] - ^y(t,/) T C- 1 (t,/)y(t,/) - Uoglit. (16) 

Furthermore, as indicated by the result of Eq. [8j when we apply models 
based on GP we obtain a predictive distribution, which makes it relatively 
simple to estimate the probability that a given event will occur. This fact is 
extremely interesting, from the financial point of view, in decision making. 
In particular, we show how models based on GP permit a straightforward 
evaluation of both VaR and ES. 

We start by calculating the Value at Risk for a time horizon T and prob- 
ability a G (0, 1). The usual definition is: 

VaR? +T = sup{v ; P[{R t+T < v)\I t ] < a} (17) 
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where P(R\I t ) is the conditional distribution of the probabilities of the asset 
returns and It is, as before, the information that conditions what occurs 
at t + T. In other words, VaR is a simple way to estimate the minimal 
potential loss that, for a given time horizon, will be contained in the 100a;% 
of the worst case scenarios. 

VaR may be estimated directly with the method we have described: we 
only have to calculate the return corresponding to the value, V£jL 1( below 
which the price of the asset may be found with probability a. The predictive 
distribution of the prices is given by the GP, thus V£Li satisfies the condition: 



a 



1 

Z 



exp 



(x - v t+1 y 



dx 



where Z is the normalization factor: 



exp 



dx 



O't+l 



1 + erf 



and erf(x) is the error function: 



erf(x) 



exp (— x 2 )dx 



Combining Eq. [18] and Eq. 19 we obtain: 

V t+1 



erf 



y/2- 



111 

aw —(7t+i 



- erf 



1 + erf 



V t+ i - V; 



t+i 



y/2, 



O't+l 



t+i 



and therefore 



erf 



Vt + i - Vfa 



y/2- 



O't+l 



;i - a)erf 



Vt+i 
y/2d~t+i 



t+i 



y/2, 



O't+l 



aZ 



a: 



(18) 



(19) 
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or 



V t a +l = V t+ i - V2a t+1 erf-\\ t+1 ) (20) 

with 

\ t+1 = (l- a )erf(^-) -a (21) 
V V 2o-f+i I 



According to Eq. 20 , our estimation of VaRf +1 will be given by: 

Va~R t+1 = R t+l - ^p^erf-\\ t+1 ) (22) 
where R t+ i and a t+ i are evaluated using Eq. [I 14 and 15, and A t+ i is given 



by Eq. 21 



We will now estimate ES with our model. If pt+i(R\It) is the conditional 
distribution of the returns of an asset and VaRf +1 the respective value at 
risk for a level a, then the expected shortfall, ES, is: 

ES? +1 = E{R t+l \I t , {VaR« +T > R t+1 )) oc I ^ Rp t+1 (R\I t )dR (23) 

J — oo 

such that, while VaR gives information about the least loss we could expect 
at a given confidence level; ES informs the worse expected loss that we can 
expect to occur. 

In those cases where hedging needs to be designed, an adequate estimate 
of the ES is more informative than VaR. Lets take as a hypothetical sit- 
uation that at a particular time t we know that the VaR®+\ of an asset or 
a portfolio is —0.001. There is thus a 99% probability that the return will 
exceed this value. However, the hedging plan will change if we estimate that 
the expected return for the remaining 1% is either —0.002 or —0.1. 

As for VaR, it is relatively simple to estimate ES using our GP based 
model. Effectively, since the predictive distributions of the prices are Gaus- 
sian, so are those of the arithmetic returns (given that these are obtained 
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from the prices by dividing them by yesterdays price). Thus, we can eval- 
uate the previous integral by calculating the expected value ESV t+1 of the 
price of the asset, as long as this is below the 100a% quantile of the predictive 
distribution of the prices V t + 1 

v t+i 



ESV 



t+i 



t+i Jo 



xexp 



[x 



V t+ i) 



dx, 



where C^ +1 is the normalization factor: 



C: 



t+i 



cxp 



[X 



V 



t+ij 



dx 



erf 



V t+1 



y/2, 



erf 



Vt + i ~ V t % i 



The integral in Eq. [24] can be readily evaluated: 

(V t a +1 -Vt+i) 2 



ESV 



t+i 



V t 



t+i 



cxp 



2 a 



~7T 



cxp 



y t+l 



2 a 



*7 
t+1 



erf 



Vt4 



V2 



f t+i 



erf 



Vt+i-V t a +i 



(24) 



and from this result we get our estimation for ES t+l : 
ESV* 



ES 



t+i 



t+i 



V t 

2 ot+i 
vr V t 



R 



t+i 



cxp 



(y t a +1 -v t +if 



2 



exp 



2 -hi 



erf 



V2 . 



erf 



V2 



,ft+i 



(25) 



In the next section we evaluate the performance of the model by showing 

-—-0.01 . ^.o.oi 

the results obtained by calculating VaR t+1 following Eq. 



22 



and ES. 



t+i 



according to Eq. 25 , for a series taken from the S&P500 index. 
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3. Backtesting VaR and ES estimations 

The set of recommendations established by the Basel Committee is cur- 
rently used to measure the quality of VaR estimates. These are statistical 
validation measurements (backtesting) based on the quantification of excep- 
tions: 

e? +1 = Q(Va~R* +1 - R t+1 ) 

that can occur when estimating VaRf +1 , where Q(x) is Heavisides step func- 
tion = 1(0) when x > (in any other case)). 

Whichever the method used for calculating VaR, the condition that must 
be satisfied is: 

E(e? +1 \I t ) = a 

and what is examined are the different consequences of this condition. 

Probably, the most utilized format is that if there are a total of n calcu- 
lated values, the total number of exceptions: 

n-1 

x — ^ e *+! 
t=0 

follows a binomial distribution with parameters n and a. Thus for a 5% 
confidence level for n = 250 evaluations (that for daily data represents ap- 
proximately one year) and a value of a = 0.01, the method will only be 
rejected when x > 5. 

For the example of the series taken from the S&P500 index represented 



- — -o.oi 

in Figure 2, we can use Eq. 22 to calculate Var t+1 over three consecutive 



windows of n — 250 data points (i.e. approximately three continuous years) 
and not once do we obtain more exceptions than those considered acceptable 
according to the validation criteria (3, 5 and 5 exceptions, respectively). 
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t+l -0.5 
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Days 



Figure 2: In the upper part of the figure a segment representing the daily returns of the 
S&P500 index over a period of three years is shown. Observe the period of high volatility 
characterized by large fluctuations in the returns. The inferior part of the figure shows our 
VaRt+l estimates. 
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As regards the evaluation of ES estimates done with the model, for each 
of the exceptions (total of 13) we compared the ES that we get using our 
model with the observed returns. Specifically, if we use if, % — 1, . . . , 13 to 
denote the 13 values of the times that correspond to the exceptions, we can 
calculate their mean squared error: 




This gives an error of 0.12, indicating the quality of the fit achieved by our 
model. 

4. Discussion 

There are several reasons why the behavior of financial markets is diffi- 
cult to model, above all, because each agent involved prefers that others do 
not have access to a model that permits them to correctly anticipate fluctu- 
ations in the market. If they did, and could thus exchange uncertainty for 
predictability, circumstantial informational advantages that could be used to 
increase net worth would be lost, which at the end of the day is what this is 
all about. 

Apart from practical motivations whose importance is obvious, the diffi- 
culties inherent to modeling combined with the ever increasing possibility of 
having access to large volumes of data and computer software, have stimu- 
lated the development of empirical models that, as well as predicting what 
the system will or will not do, also attempt to estimate the degree of uncer- 
tainty that this may present. 
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Models based on Gaussian processes incorporate this option in a direct 
way as a consequence of their structure. This is advantageous due to the 
simplification of the theory involved and also from a practical point of view. 
They do this by assuming that the series of values of an asset is a realization 
of a random process and that this is Gaussian. Thus, predictive conditional 
distributions that are Gaussian centered on the expected values of the asset 
are obtained, whose standard deviations fix the respective volatility values 
and whose quantiles correspond to estimates of risk indexes such as VaR and 
ES. All this without the necessity of introducing additional assumptions, 
such as for example, the evolution of the conditional variance and/or the 
distribution of the observed returns, which are necessary in ARCH/GARCH 
type models and in models of stochastic volatility. 

Another beneficial aspect is the ease with which we can include the as- 
sumptions as relevant information that condition the predictive distribution 
in each case. One of these benefits can be when we suppose that the infor- 
mation is given by the way in which the system evolves from similar initial 
situations. The problem is thus reduced to specifying the set of vectors 
that this information represents and to evaluate the corresponding covari- 
ance function. 

With regards this last point, it is woth mentioning that even though 
all the results we have presented correspond to a particular selection of the 
covariance function, the key issue is that whatever the proposed function is, 
this generates a covariance matrix whose elements are much higher among 
vectors that are similar than among those that are not, which falls rapidly 
with the distance between vectors. Finally, the models presented here permit 
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the establishment of simple criteria for risk evaluation that could be useful in 
the design of dynamic portfolios with optimal risk, which could in turn result 
in portfolios that generate higher equity than that produced by minimum risk 
portfolios. These are two of the aspects we wish to focus on in a further study. 
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